Segmented video preview controls by remote participants in a video communication session

ABSTRACT

A method allows a receiving device to control aspects of an image stream transmitting (IST) device. The method includes receiving a segmented video stream from the IST device during a video communication session. The segmented video stream includes a primary segment and a secondary segment presenting a virtual control interface. The method includes presenting, on a display device, a control enabling view of the segmented video stream and monitoring for movement, captured by at least one image capturing device, that is indicative of the input occurring within a spatial location proximate to the selectable feature. The method includes in response to detecting the input, identifying a function associated with the selectable feature and activating the function to affect one or more characteristics of a video image presented within the primary segment.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is related to application Ser. Nos. 17/826,603,17/826,639 and 17/826,683 filed on an even date herewith, the contentsof which are fully incorporated herein by reference.

BACKGROUND 1. Technical Field

The present disclosure generally relates to electronic devices withvideo capturing capabilities and in particular to electronic deviceswith video capturing capabilities that are utilized within a videocommunication session.

2. Description of the Related Art

Modern smartphones are equipped with some of the best digital cameras inthe consumer market. Smartphones are often utilized to enable videocommunication sessions with a second device, where the front camera isused to focus on the user while the display presents the video receivedfrom the second device. However, given the limitations in screen size,persons engaged in video communication sessions, such as a videoconference, often prefer to use their laptops or desktop devices thatpresent a much larger display for viewing the received video within thecommunication application interface that can also present one of morecommunication application settings icons/menu items.

Unlike the high-quality inbuilt cameras of the smartphones, the built-inwebcams of laptops and external universal serial bus (usb)-connectedwebcams are very basic and provide limited video options for the userwho utilizes their personal computer (PC) or laptop to engage in a videocommunication session. Because of the low quality of the video imagescaptured by these webcams, recent solutions have been provided forsmartphone users to utilize the better-quality cameras available withtheir smartphones to capture the video that is transmitted as the videofeed in video communication sessions with other devices. The smartphonesare wired or wirelessly connected to the PC, which provides a largerdisplay device for video review and viewing.

BRIEF DESCRIPTION OF THE DRAWINGS

The description of the illustrative embodiments can be read inconjunction with the accompanying figures. It will be appreciated thatfor simplicity and clarity of illustration, elements illustrated in thefigures have not necessarily been drawn to scale. For example, thedimensions of some of the elements are exaggerated relative to otherelements. Embodiments incorporating teachings of the present disclosureare shown and described with respect to the figures presented herein, inwhich:

FIG. 1 illustrates an example video communication session environmenthaving a primary electronic device exchanging a segmented video feedwith at least one second electronic device via a video communicationsession, according to one or more embodiments;

FIG. 2A depicts an example communication device with an attachedexternal display and within which various aspects of the disclosure canbe implemented, according to one or more embodiments;

FIG. 2B is a block diagram of example contents of the system memory ofthe example communication device of FIG. 2A, according to one or moreembodiments;

FIG. 3 depicts a settings user interface for a video communicationapplication operating on the communication device and presenting a userselectable option for activating local video segmenting features of anartificial intelligence (AI) engine, in accordance with one or moreembodiments;

FIG. 4A provides an example illustration of a presentation by anelectronic device of a capture video image of a local scene within auser interface of a video communication application, the user interfacepresenting a selectable segmenting feature, according to one or moreembodiments;

FIGS. 4B-4C illustrates the segmenting of a preview of captured videoimages into a plurality of segments, including a primary segmentencompassing a primary region of interest (ROI) and individual, verticaland horizontal segments of the video image, with each vertical segmentproximate to a selection object associated with one of the devicesettings and/or application settings, according to one or moreembodiments;

FIG. 5A illustrates individual, physically separated, vertical segmentsof the video image, surrounding a central, primary segment that includesthe participant's upper torso and face as the ROI, with at least onevertical segment presenting an action center within selectablefunctions/features superimposed over the original side segment to enableselection via air gesturing at a normal spatial position of theparticipant's hand, according to one or more embodiments;

FIG. 5B illustrates individual, physically separated, vertical segmentsof the video image, surrounding a central, primary segment that includesthe participant's user's upper torso and face, with at least onevertical segment presenting a virtual whiteboard that can be written onvia air/finger gesturing, according to one or more embodiments;

FIGS. 6A-6C and 7A-7C presents pairing of the video image presented by alocal communication device display and on a remote participant's displayfollowing segmentation of the video image and local participantselection of background features from the action center, according toone or more embodiments;

FIG. 8 illustrates a different set of example control functions,including a sliding bar selection, provided within an action center withvirtual user interface objects overlaying a side segment, according toone or more embodiments;

FIG. 9 illustrates an example of a feature for applying changes in thevideo image background effect to only the primary segment that is beingtransmitted to the video communication session, according to one or moreembodiments;

FIGS. 10A-10B (collectively FIG. 10 ) depict a flowchart of a method bywhich an electronic device performs segmenting of a locally capturedvideo image presented within a preview of the video image thatincorporates an action center with a virtual user interface for receiptof user selections via air gestures, according to one or moreembodiments;

FIG. 11 illustrates an example selection table that is generated andpresented on a display of a local participant to enable selection ofwhich remote, second participant on the video communication session isprovided access to the action center features that control/manipulatecharacteristics of the primary segment on the local device that islocally presented and/or presented within the video feed to otherremote, second participants, according to one or more embodiments;

FIG. 12 illustrates components of a video communication environment inwhich a receiving second device receives a video feed from and imagestream transmitting (IST) device along with virtual action centers thatenable remote modification of the image feed, according to one or moreembodiments;

FIG. 13 illustrates components of a video communication environment inwhich a receiving second device remotely modifies the content and/orcharacteristics of at least the primary segment of a video image that isreceived from the IST device, according to one or more embodiments;

FIG. 14 depicts a flowchart of a method by which an electronic deviceselectively transmits different combinations of primary segments of asegmented local video image and one or more virtual interfaces of actioncenters associated with modifying one or more characteristics of theprimary segment and generating visible feedback of which second devicesreceived the enhanced action center functions, according to one or moreembodiments; and

FIG. 15 depicts a flowchart of a method by which an electronic devicereceives, via a video communication session, a video feed with asegmented video image from an IST device and performs control operationson one or more characteristics of the video image at the IST deviceusing a received and displayed virtual interface of an action centerassociated with the received video image, according to one or moreembodiments.

DETAILED DESCRIPTION

An electronic device, a method, and a computer program product providesconcurrent transmission of a virtual action center with a primarysegment cropped from a video image, enabling remote participant controlof aspects of the primary segment. The method includes identifying aprimary region of interest (ROI) within a video image and delineatingthe video image into a primary segment encompassing the primary ROI anda secondary segment. The method includes associating a virtual actioncenter within a location of the secondary segment and assigning controlfeatures of the virtual action center to the second device connected viaa video communication session. The method includes transmitting theprimary segment and concurrently transmitting the features/functions ofthe virtual action center to the second device, enabling the seconddevice to concurrently present, on its display, the primary segment andthe virtual action center as portions of a received video stream,providing second device control of content within the primary segment.

According to a first aspect of the disclosure, an electronic device, amethod, and a computer program product provides segmenting of a videoimage into a primary and secondary segments and presentation of atouchless action center in a display area of one or more of thesecondary segments to support touchless user interfacing and controlduring a preview of the video image.

In a first embodiment of the first aspect, an electronic device includesat least one image capturing device that captures video of a local sceneand an interface by which the electronic device connects to a displaydevice. The electronic device also includes a memory having storedthereon a video image segmentation and control module (VISCM) thatprovides program instructions for spatially delineating video imagesinto two or more segments that can be individually presented within orexcluded from a video preview and/or video feed. The electronic devicealso includes at least one processor communicatively coupled to thedisplay device, to each of the at least one image capturing device, andto the memory. The at least one processor executes the programinstructions of the VISCM, which causes the at least one processor toidentify a primary region of interest (ROI) within a video imagecaptured by the at least one image capturing device, the primary regionof interest being a smaller area than the video image. The at least oneprocessor delineates the video image into two or more segments,including a primary segment that encompasses the primary ROI and atleast a secondary segment. The at least one processor then associates atouchless action center with a virtual interface with a location of thesecondary segment in the video image. The virtual interface includes atleast one feature that can be selected via one of air gestures andscreen touches during presentation of the primary segment of the videoimage. The at least one processor then presents a preview of thedelineated segments of the video image on the display device with atleast the primary segment and the secondary segment of the video imageand the virtual interface being visible within the display device.

According to one or more embodiments, the processor is furtherconfigured to detect, within a captured video image, one or more airgestures proximate to a location of a specific feature among the atleast one feature within the virtual interface, the one or more airgestures representing a selection of the specific feature. The at leastone feature comprises at least one of image capturing deviceconfiguration and settings, video image presentation settings, localdevice settings, and application settings for a video conferenceapplication enabling a video communication session with at least thesecond device. The processor is further configured to, in response toidentifying the selection of the specific feature, enable acorresponding function associated with the specific feature concurrentlywith a capturing and transmission of at least the primary segment of thevideo image.

According to one or more embodiments, the processor is furtherconfigured to identify selection of the specific feature, whichactivates a function that modifies one or more characteristics of atleast the content within the primary segment of the video image, applythe function to an original content within at least the primary segmentto generate a resulting modified content, and locally present andremotely transmit the resulting modified content within the primarysegment in place of original content.

According to one or more embodiments, the processor is furtherconfigured to transmit only content within the primary segment to acommunicatively-connected second device to which the video image is tobe shared. The secondary segment(s) and the associated virtualinterfaces of the touchless action center are only presented during thevideo preview on the local display device.

Additional embodiments provide a method and a computer program productthat provide similar functions as the described embodiments of theelectronic device.

According to a second aspect of the disclosure, an electronic device, amethod, and a computer program product provides granular transmission ofselect segments of a segmented video image along with a touchless actioncenter to at least one second device among a plurality of second devicesthat are communicatively connected by a video communication session toenable the at least one second device to control one or more features orfunctions associated with the video feed being transmitted by theelectronic device.

In a first embodiment of the second aspect, the electronic device alsoincludes a communication interface that enables the electronic device tocommunicatively connect and exchange video data with a plurality ofsecond devices during a video communication session. The at least oneprocessor is communicatively coupled to the display device, to each ofthe at least one image capturing device, to the communication interface,and to the memory. The processor processes instructions of the VISCM,which causes the at least one processor to identify a primary region ofinterest (ROI) within a video image captured by the at least one imagecapturing device, the primary region of interest being a smaller areathan the video image. The processor delineates the video image into twoor more segments, including a primary segment that encompasses theprimary ROI and at least one secondary segment. The processorlinks/presents/associates an action center with a virtual interfacewithin a location of one or more of the at least one secondary segmentin the video image. The virtual interface includes at least one featurethat can be selected via one of air gestures and screen touches duringpresentation of the primary segment of the video image. The processorassigns control features of the virtual interface with at least oneselected second device from among the plurality of second devices. Theprocessor transmits, to each of the plurality of second devices, atleast the primary segment for displaying at a second display of acorresponding second device during the video communication session. Theprocessor concurrently transmits, to each of the at least one selectedsecond device, the control features of the virtual interface assignedwith the at least one selected second device to trigger the at least oneselected second device to present the virtual interface on a seconddisplay device, concurrently with presenting the primary segment asparts of a combined video stream from the electronic device.Accordingly, each of the plurality of second devices can be transmitteda different combination of segmented video images comprising at leastthe primary segment with zero or more virtual interfaces remotelypresented on a respective second display for selection at a respectiveat least one selected second device.

In one or more embodiments, to associate the virtual interface with theat least one selected second device from among the plurality of seconddevices, the at least one processor presents a conference participantinterface that receives a selection ranging from between none (zero) toone or more virtual interfaces to provide to a particular second deviceidentified via an associated participant name. And, in response toreceiving the selection, the at least one processor associates aselected one or more virtual interfaces with the particular seconddevice. The at least one processor presents, on the local displaydevice, a local preview window including a visual indication of which ofthe one or more virtual interfaces are associated for transmission toeach of the plurality of second devices designated to receive at leastone virtual interface within a transmitted video stream.

According to one or more embodiments, the at least one processor detectsselection of the particular second device as an administrative deviceand auto-selects specific virtual interfaces having functionscontrollable by the administrative device. The at least one processorthen transmits the auto-selected virtual interfaces concurrently withthe primary segment to enable virtual interfaces to be visuallypresented within the second display of the particular second deviceconcurrently with the primary segment. The at least one processorfurther monitors for receipt of an indication of a remote selection oractivation of at least one control function from within the virtualinterfaces at the particular second device. And, in response toreceiving an indication of a remote selection that is associated with amodification of a characteristic of the video images within the primarysegment, the at least one processor applies the modification to videoimage content within the primary segment at the electronic device andtransmits the modification of the video image content to each of theplurality of second devices participating in the video communicationsession.

According to a third aspect of the disclosure, an electronic device, amethod, and a computer program product provides an electronic devicethat receives the primary segment of a segmented video image along witha touchless action center from a video communication session andcontrols features of the primary segment at an image stream transmitting(IST) device via the virtual interface of a touchless action center thatis also transmitted by the IST device to the electronic device. Thedevice includes a communication interface that enables the electronicdevice to communicatively connect with and receive video data from theIST device during a video communication session. The at least oneprocessor receives the segmented video stream with the primary segmentpresenting a region of interest captured at the IST device and the atleast one secondary segment, each presenting a respective virtualcontrol interface. The processor presents, on the local display device,a control enabling view of the segmented video stream, including boththe primary segment and the at least one secondary segment with therespective virtual control interface. Each virtual control interfacefurther presents at least one selectable feature that can be selected byan input from among a screen touch and an air gesture during localpresentation of the received segmented video stream. The processormonitors for movement captured by the at least one image capturingdevice that is indicative of the input occurring within a spatiallocation proximate to the selectable feature within a correspondingsecondary segment. And, in response to detecting the input, identify afunction associated with the selectable feature and activate thefunction to affect one or more characteristics of a video imagepresented within the primary segment.

In a first embodiment of the second aspect, the selectable featuremodifies at least one characteristic of content visibly presented withinthe primary segment, and the processor modifies the characteristic ofthe content in response to the input. In one or more embodiments, theprocessor modifies the characteristic of the content at the IST deviceby transmitting a corresponding content settings update indication tothe IST device via the video communication session. In one or moreembodiments, the processor modifies the characteristic of the contentfor at least one of: the electronic device only; and all connecteddevices participating in the video communication session that receivethe primary segment from the IST device.

According to a fourth aspect of the disclosure, an electronic device, amethod, and a computer program product provides transmission of selectsegments of a segmented video image along with a touchless action centerto a second device that is communicatively connected by a videocommunication session to enable the second device to control one or morefeatures or functions associated with the video feed being transmittedby the electronic device.

In a first embodiment of the fourth aspect, the electronic deviceincludes a memory having stored thereon a video image segmentation andcontrol module (VISCM) and a demonstration object preview module (DOPM).The VISCM includes first program instructions for spatially delineatingvideo images into two or more segments that can be individuallypresented or hidden within a video feed, and the DOPM includes secondprogram instructions for selectively reversing image mirroring topreviews of individual segments of a video feed. The processor processesthe first and second program instructions of the VISCM and the DOPM,which configures the at least one processor to identify a demonstrationobject within a video image captured by the at least one image capturingdevice, the demonstration object being in a defined area of the videoimage. The processor spatially segments the video image into two or moresegments, including a primary segment that encompasses the defined areawith the demonstration object and at least one secondary segmentencompassing a remaining portion of the video image. The processor thenpresents a preview of the delineated segments of the video image on thelocal display device, with the remaining portion of the video imagemirrored within the preview and at least the primary segment presentedwithout mirroring in a correct spatial location relative to theremaining portion of the video image. The person who is presenting thedemonstration object receives a non-mirrored, preview presentation ofthe demonstration object.

In a first embodiment of the fourth aspect, the processor spatiallysegments and reorients the primary segments relative to the mirroredview of the secondary segment in response to activation of ademonstration object preview (DOP) option within an applicationsupporting presentation of a camera image preview. In one or moreembodiment, the processor divides the remaining portion into at least asecond and a third segments, with the secondary segment encompassing aprimary region of interest and the third segment encompassing sectionsof the video image than can be selectively cropped from the remainingportion of the video image. The processor presents a virtual interfaceon the display device, visibly adjacent to or integrated as an overlayon top of the third segment. The processor presents, within the virtualinterface, the demonstration object preview (DOP) option, which isselectable via the virtual interface by one or more of an air gestureand a touch of a spatial area of the virtual interface on the displaydevice.

According to one or more embodiments, the processor reorients at leastone of an exterior boundary of the primary segment and a view angle ofprimary segment to present substantially seamlessly fit of the primarysegment into a space from which the primary segment is remove within theremaining portion of the video image during a segmentation and previewgeneration process.

Each of the described features and functions of the primary, second,third, and fourth aspects, which are presented as operations performedby the processor(s) of electronic devices are also described as featuresand functions provided by a plurality of corresponding methods andcomputer program products, within the various different embodimentspresented herein. In the embodiments presented as computer programproducts, the computer program product includes a computer readablestorage device having stored thereon program instructions or code that,when processed by at least one processor of an electronic device, suchas described above, enables the electronic device to complete thefunctionality of a respective one of the above-described processes.

The above contains simplifications, generalizations and omissions ofdetail and is not intended as a comprehensive description of the claimedsubject matter but, rather, is intended to provide a brief overview ofsome of the functionality associated therewith. Other systems, methods,functionality, features, and advantages of the claimed subject matterwill be or will become apparent to one with skill in the art uponexamination of the figures and the remaining detailed writtendescription. The above as well as additional objectives, features, andadvantages of the present disclosure will become apparent in thefollowing detailed description.

One motivation for implementation of the segmenting of a videofeed/image and presentation of the virtual action center within a localpreview is the recognition of the unmet need with conventional videoconferencing applications that utilize the camera of a smartphone thatis tethered to a larger display for image capturing. With these videoconferencing applications, the smartphone provides video of the entirelocal scene that is captured by the phones' camera sensors within thetransmitted video feed to the remote participants of the videocommunication session. While tracking of the user can be performed bythe sophisticated devices, if the user is not within arms-length of thephone, the user is unable to control the camera or application settingsunless the user comes closer to the smartphone and uses his/her hands totaps the settings icon on the smartphones local display. Oftentimes, thecameras being utilized in these scenarios are the rear cameras, whichare typically better cameras for video capture, and the devices displayis then not located on the side that is readily accessible to the user.While in active video communication, every action taken by the user toadjust or move the camera is captured within the field of view of therear camera and is transmitted to all of the participants receiving thevideo feed of the local user.

In the following description, specific example embodiments in which thedisclosure may be practiced are described in sufficient detail to enablethose skilled in the art to practice the disclosed embodiments. Forexample, specific details such as specific method orders, structures,elements, and connections have been presented herein. However, it is tobe understood that the specific details presented need not be utilizedto practice embodiments of the present disclosure. It is also to beunderstood that other embodiments may be utilized and that logical,architectural, programmatic, mechanical, electrical and other changesmay be made without departing from the general scope of the disclosure.The following detailed description is, therefore, not to be taken in alimiting sense, and the scope of the present disclosure is defined bythe appended claims and equivalents thereof.

References within the specification to “one embodiment,” “anembodiment,”“embodiments”, or “one or more embodiments” are intended toindicate that a particular feature, structure, or characteristicdescribed in connection with the embodiment is included in at least oneembodiment of the present disclosure. The appearance of such phrases invarious places within the specification are not necessarily allreferring to the same embodiment, nor are separate or alternativeembodiments mutually exclusive of other embodiments. Further, variousfeatures are described which may be exhibited by some embodiments andnot by others. Similarly, various aspects are described which may beaspects for some embodiments but not other embodiments.

The terminology used herein is for the purpose of describing particularembodiments only and is not intended to be limiting of the disclosure.As used herein, the singular forms “a”, “an”, and “the” are intended toinclude the plural forms as well, unless the context clearly indicatesotherwise. It will be further understood that the terms “comprises”and/or “comprising,” when used in this specification, specify thepresence of stated features, integers, steps, operations, elements,and/or components, but do not preclude the presence or addition of oneor more other features, integers, steps, operations, elements,components, and/or groups thereof. Moreover, the use of the terms first,second, etc. do not denote any order or importance, but rather the termsfirst, second, etc. are used to distinguish one element from another.

It is understood that the use of specific component, device and/orparameter names and/or corresponding acronyms thereof, such as those ofthe executing utility, logic, and/or firmware described herein, are forexample only and not meant to imply any limitations on the describedembodiments. The embodiments may thus be described with differentnomenclature and/or terminology utilized to describe the components,devices, parameters, methods and/or functions herein, withoutlimitation. References to any specific protocol or proprietary name indescribing one or more elements, features or concepts of the embodimentsare provided solely as examples of one implementation, and suchreferences do not limit the extension of the claimed embodiments toembodiments in which different element, feature, protocol, or conceptnames are utilized. Thus, each term utilized herein is to be providedits broadest interpretation given the context in which that term isutilized.

Those of ordinary skill in the art will appreciate that the hardwarecomponents and basic configuration depicted in the following figures mayvary. For example, the illustrative components within communicationdevice 110 (FIGS. 2A-2B) are not intended to be exhaustive, but ratherare representative to highlight components that can be utilized toimplement the present disclosure. For example, other devices/componentsmay be used in addition to, or in place of, the hardware depicted. Thedepicted example is not meant to imply architectural or otherlimitations with respect to the presently described embodiments and/orthe general disclosure.

Within the descriptions of the different views of the figures, the useof the same reference numerals and/or symbols in different drawingsindicates similar or identical items, and similar elements can beprovided similar names and reference numerals throughout the figure(s).The specific identifiers/names and reference numerals assigned to theelements are provided solely to aid in the description and are not meantto imply any limitations (structural or functional or otherwise) on thedescribed embodiments.

Referring now to the figures and beginning with FIG. 1 , there isillustrated an example video communication session (VCS) environment 100having a first communication device 110 exchanging a segmented videofeed 128 via the communication session 136 with at least one secondelectronic device 120A-120N, according to one or more embodiments. Firstcommunication device 110 is an electronic device that (i) supports videocommunication with other devices, includes one or more image capturingdevices/sensors (i.e., cameras), and which provides the capability ofbeing utilized as a web camera during the video communication session.In one or more embodiments, communication device 110 is a mobile phoneand/or incorporates the features of a mobile phone. According to one ormore embodiment, communication device 110 can be interchangeablyreferred to as image stream transmitting (IST) device. Communicationdevice 110 is also interchangeably referred to as first communicationdevice 110 to distinguish from second communication devices 140 a-140 n.For simplicity in describing certain features of the disclosure, wheresecond communication devices 140 a-140 n are individually referenced asdifferent communication devices, second communication devices 140 a-140n can be individually presented as second communication device 140 a andthird communication device 140 n.

Returning to FIG. 1 , VCS environment 100 includes a local participant105 who communicatively connects to video communication session 136using mobile communication device 110 that is wired or wirelesslyconnected to display 120. Display 120, which can also be referred to asa display device, is a local display. The displays associated with thesecond communication devices 140 a-140 n are referred to as seconddisplays or remote displays. Display 120 is also external tocommunication device 110 and can thus be referred to as external display120. Communication device 110 is positioned on a desk proximate to localparticipant 105. In the described embodiments, communication device 110is a smartphone held by docking dongle 112 and includes several (i.e.,at least one) image capturing sensors/device (ICD) 115. Docking dongle112 provides a wired USB connection to large display device 120 on whichvideo images captured by ICD 115 and video feeds (129) received fromvideo communication session 136 can be displayed for viewing by localparticipant 105. ICD 115 captures a field of view (FOV) (illustrated bythe dashed lines and generally presented as FOV 117) of local spaceincluding the local participant 105. ICD 115 generates video images thatare presented within a video communication application (VCA) userinterface 122 on display 120 for preview by local participant 105.According to one aspect of the disclosure, communication device 110segments the locally captured video images to provide a primary segment124 that includes the region of interest (ROI), which is typically aface and upper torso of local participant 105. Additionally secondarysegments then provide the remaining content captured within the localscene. According to one aspect of the disclosure, local preview ondisplay 120 can also provide additional virtual control panels 126 a,126b, which occupy or overlay segments of the original video image that arenot within the primary segment.

Communication device 110 transmits segmented video feed 128 comprised ofat least the primary segment through communications network 130, whichincludes wireless communication network 132 and associated physicalnetworking components and wide area network 134 and associated physicalnetworking components. Wide area network 134 provides or supportsconnection by video communication application server(s) 135, which isone or more physical components or logical partitions within serversthat process server-level functions for video communication session 136.Segmented video feed 128 is forwarded by video communication applicationserver(s) 135 to each session-connected second communication device 140a-140 n, which presents segmented video feed 128 on respective remotedisplays 142 a-142 n for viewing by their respective remote participants144 a-144 n.

It is appreciated that the term segmented video feed can mean differentthings within the context of the various different aspects of thedisclosure. The received video feed is a complete video feed thatappears seamless to the remote participants 144 a-144 n, but cancomprise only one of (or less than a full completement of) multiplesegments into which the original video stream captured at the local ICD115 is divided/delineated.

Turning now to FIG. 2A, there is depicted an example communicationdevice 110 within which various aspects of the disclosure can beimplemented, according to one or more embodiments. Examples of suchcommunication devices include, but are not limited to, mobile devices, anotebook computer, a mobile phone, a smart phone, a digital camera withenhanced processing capabilities, a smart watch equipped with an ICD andenhanced processing, a tablet computer, and other types of communicationdevice that incudes or can be directly tethered to an ICD. It isappreciated that communication device 110 can be other types ofelectronic devices that include at least one front facing camera or onerear facing camera and which supports both video and non-videocommunication with one or more second communication devices.

Communication device 110 includes processor 202 (or processor integratedcircuit (IC) chip), which includes processor resources such as centralprocessing unit (CPU) 203 a, communication signal processing resourcessuch as digital signal processor (DSP) 203 b, and graphics processingunit (GPU) 203 c. Processor 202 can, in some embodiments, include highquality camera image signal processors (ISPs) (not shown) and dedicatedartificial intelligence (AI) engines 205. Collectively, processor 202supports computing, classifying, processing, transmitting and receivingof data and information, and presenting of graphical images within adisplay. Processor 202 is communicatively coupled to storage device 204,system memory 220, input devices, introduced below, output devices,including integrated display 230, and image capture device (ICD)controller 234. According to one or more embodiments, ICD controller 234performs or supports functions such as, but not limited to, selectingand activating an active camera from among multiple cameras, adjustingthe camera settings and characteristics (e.g., shutter speed, f/stop,ISO exposure, zoom control, etc.) of the active camera, etc. ICDcontroller 234 can perform these functions in response to commandsreceived from processor 202, which processes instructions of cameracontrol module (296, FIG. 2B) to cause one or more of ICDs 232, 233 tocapture video images of a local scene within a FOV (117) of theoperating ICD. In one or more embodiments, the functionality of ICDcontroller 234 is incorporated within processor 202, eliminating theneed for a separate ICD controller.

For simplicity in describing the features presented herein, the variouscamera selection, activation, and configuration functions performed bythe ICD controller 234 are described as being provided generally byprocessor 202. Similarly, manipulation of captured images and videos aretypically performed by GPU 203 c, and certain aspects of devicecommunication via wireless networks are performed by DSP 203 b withsupport from CPU 203 a. However, for simplicity in describing thefeatures of the disclosure, the functionality provided by one or more ofCPU 203 a, DSP 203 b, and GPU 203 c are collectively described as beingperformed by processor 202.

Throughout the disclosure, the term image capturing device (ICD) isutilized interchangeably to be synonymous with and/or refer to any oneof front or rear facing cameras 232, 233. Front facing cameras (or imagecapture device (ICD)) 232 and rear facing cameras 233 arecommunicatively coupled to ICD controller 234, which is communicativelycoupled to processor 202. Both sets of ICDs 232, 233 includes imagesensors that can capture images that are within the field of view (FOV)of respective ICD 232, 233. Communication device 110 can includesmultiple cameras having different functionality, such as a main cameracapturing standard view, wide angle camera that captures a wide angleFOV, and telephoto ICD, which captures a telephoto FOV (zoom ormagnified). In one or more embodiments, a single camera can be providedwith camera control options to change the single camera lends to allowfor wide angle and telephoto image capture.

System memory 220 may be a combination of volatile and non-volatilememory, such as random access memory (RAM) and read-only memory (ROM).System memory 220 can store program code or similar data associated withfirmware 222, an operating system 224, and/or applications 226. Duringdevice operation, processor 202 loads and executes/processes programcode of the various applications, modules, OS and firmware, that arestored in system memory 220.

In accordance with one or more embodiments, applications 226 include,without limitation, video image segmentation and control module (VISCM)250, video communication session application (VCSA) 252, demonstrationobject preview module (DOPM) 254, and communication module 256. As thenames suggest, each module and/or application (250-256) provides programinstructions that are processed by processor 202 to cause processor 202and/or other components of communication device 110 to perform specificoperations, as described herein. Specifically, VCSA 252 andcommunication module 256 include program instructions that supportscommunication device 110 establishing a communication session with otherexternal devices and systems, such as VCA server 135 and secondcommunication devices 140 a-140 n. VISCM includes program instructionsthe support processor 202 delineating parts of a video feed intosegments that can be separately presented and/or transmitted andinstructions that support processor 202 presenting the touchless actioncenters with virtual interfaces and functionality associated therewith.

In one or more embodiments, communication device 110 includes removablestorage device (RSD) 236, which is inserted into RSD interface 237 thatis communicatively coupled via system interlink to processor 202. In oneor more embodiments, RSD 236 is a non-transitory computer programproduct or computer readable storage device. RSD 236 may have a versionof one or more of the applications (250, 252, 254) stored thereon.Processor 202 can access RSD 236 to provision communication device 110with program code that, when executed/processed by processor 202, theprogram code causes or configures processor 202 and/or generallycommunication device 110, to provide the various different functionalitydescribed herein.

Communication device 110 includes an integrated display 230 whichincorporates a tactile, touch screen interface 231 that can receive usertactile/touch input. As a touch screen device, integrated display 230allows a user to provide input to or to control communication device 110by touching features presented within/below the display screen. Thetactile, touch screen interface 231 can be utilized as an input device.In some implementations, display 230 is integrated into a front surfaceof communication device 110, while the higher quality ICDs are locatedon a rear surface. Communication device 110 is placed in an orientationwith the higher quality ICDs facing the scene being captured andintegrated display located away from the subject of interest.

As one aspect of the disclosure, communication device 110 also includesexternal display 120, which is communicatively coupled to communicationdevice 110 via a physical interface 265 a or a wireless interface 265 b.Display 230 can be one of a wide variety of display screens or devices,such as a liquid crystal display (LCD) and an organic light emittingdiode (OLED) display.

Wireless interface 265 b can be a short-range wireless communicationdevice providing Bluetooth, near field communication (NFC) and/or awireless fidelity (Wi-Fi) connections. In one embodiment, communicationdevice 110 can receive internet or Wi-Fi based calls via wirelessinterface 265 b. In one embodiment, communication device 110 cancommunicate wirelessly with externally provided WiFi router via wirelessinterface 265 b. In an embodiment, WCS 242, antenna(s) 248 and wirelessinterface 265 b collectively provide communication interface(s) ofcommunication device 110. These communication interfaces enablecommunication device 110 to communicatively connect to at least onesecond communication device 140 (FIG. 1 ) via at least one network.

Physical interface 265 a of communication device 110 can serve as a dataport and can be coupled to charging circuitry 235 and device battery 243to enable recharging of device battery 243. Enabling the audiocommunication aspects for video communication session, communicationdevice 110 further includes microphone 208, one or more output devicessuch as speakers 244, and one or more input buttons 207 a-207 n. Inputbuttons 207 a-207 n may provide controls for volume, power, and imagecapture device 232, 233. Microphone 208 can also be referred to as anaudio input device. Microphone 208 and input buttons 207 a-207 n canalso be referred to generally as input devices.

Communication device 110 further includes wireless network communicationsubsystem (WCS) 242, which can represent one or more front end devices(not shown) that each coupled to one or more antennas 248. In one ormore embodiments, WCS 242 can include a communication module with one ormore baseband processors or digital signal processors, one or moremodems, and a radio frequency (RF) front end having one or moretransmitters and one or more receivers. WCS 242 and antennas 248 allowcommunication device 110 to communicate wirelessly with a wirelesscommunication network 132 (FIG. 1 ) via transmissions of communicationsignals to and from network communication devices, such as base stationsor cellular nodes, of wireless communication network 132.

Wireless communication network 132 further allows communication device110 to wirelessly communicate with second communication devices 140a-140 n, which can be similarly connected to wireless communicationnetwork 132. Communication device 110 can also communicate wirelesslywith wireless communication network 132 via communication signalstransmitted by short range communication device(s) (264) to and from anexternal WiFi router, which is communicatively connected to network 132.In one or more embodiment, wireless communication network 132 caninclude one or more servers (e.g., 135) that support exchange ofwireless data and video and other communication between communicationdevice 110 and second communication device 140 a-140 n.

Communication device 110 further includes haptic touch controls 245,vibration device 246, fingerprint/biometric sensor 247, globalpositioning system (GPS) device 260, and motion sensor(s) 262. Vibrationdevice 246 can cause communication device 110 to vibrate or shake whenactivated. Vibration device 246 can be activated during an incoming callor message in order to provide an alert or notification to a user ofcommunication device 110. In one or more embodiments, vibration device246 can be used to inform the user when delineation and separation ofthe segments of a video image occurs and/or when a gesture is detectedand accepted. According to one aspect of the disclosure, integrateddisplay 230, speakers 244, and vibration device 246 can generally andcollectively be referred to as output devices.

Biometric sensor 247 can be used to provide biometric data, such asfingerprints, to identify or authenticate a user. GPS device 260 canprovide time data and location data about the physical location ofcommunication device 110 using geospatial input received from GPSsatellites.

Motion sensor(s) 262 can include one or more accelerometers 263 andgyroscope 264. Motion sensor(s) 262 can detect movement of communicationdevice 110 and provide motion data to processor 202 indicating thespatial orientation and movement of communication device 110.Accelerometers 263 measure linear acceleration of movement ofcommunication device 110 in multiple axes (X, Y and Z). For example,accelerometers 263 can include three accelerometers, where oneaccelerometer measures linear acceleration in the X axis, oneaccelerometer measures linear acceleration in the Y axis, and oneaccelerometer measures linear acceleration in the Z axis. Gyroscope 264measures rotation or angular rotational velocity of communication device110. In one or more embodiments, the measurements of these varioussensors can also be utilized by processor 202 in the determining of thecontext of a communication. Communication device 110 further includeshousing that contains/protects the components internal to communicationdevice 110.

Referring to FIG. 2B, there is shown one embodiment of example contentsof system memory 220 of communication device 110. As generallyintroduced within FIG. 2A, system memory 220 includes data, software,and/or firmware modules, including applications 226, operating system224, firmware 228, and communication module 256. Applications generallyinclude VISCM 250, VCSA 252, DOPM 254, and camera control module (CCM)296.

Each of the various modules and applications includes programinstructions/code that is processed by processor 202 and in someinstances AI engine 205 to configure communication device 110 to performthe various different features of the present disclosure. In one or moreembodiments, several of the modules include a representation of AIengine 205′, which can further include a machine learning (ML) engine(not separately shown) and computer vision (CV) engine (not separatelyshown). AI engine 205′ enables communication device 110 to identify aregion of interest (ROI) within a locally captured video stream, trackone or more subjects within a video stream, identify gesture providedinputs and associate the inputs with appropriate functions providewithin virtual control interfaces, and identify text demonstrationobjects within a video stream, as well as other features and functionspresented herein. In one or more embodiments, execution of the variousmodules by processor 202 enables/configures communication device 110 toperform the method processes presented in the various differentflowcharts, as will be described below.

VISCM 250 includes or utilized a local representation of AI engine 205′which provides the functions of ROI identification 270 and XY(Z) pixelsegmentation 272. As provided within AI engine data 290, AI engine 205′received video image stream 291 and detects specific subjects, such as ahuman, that are to be represented within ROI images 292. Once ROI image292 is detected within video image stream 291, AI engine 205′ determinesa primary segment having primary segment XYZ coordinates 293. Once thecoordinates of the primary segment are identified and the primarysegment delineated, AI engine 205′ identifies secondary segmentcoordinates 294 a-294 n, which further delineate each of the secondarysegments. AI engine 205′provides processor 202 with the required pixelcoordinates for the primary segment and also for the side, secondarysegments that can be utilized for presenting the action centers withvirtual user interface functions. In one embodiment, AI engine 205′ alsomonitors video image stream 291 for images within the physical spaceproximate to the presented locations of the action center(s) on thedisplay device. AI engine data 290 thus includes camera detected handgestures/inputs 295A and gesture identifying/interpreting database 295Bthat can include various different gestures or hand/finger movementsthat correspond to a selection/activation of an action feature/functionwithin the virtual interface. It is appreciated that one or more of thepresented AI engine operations may be performed by other processorcomponents.

According to one feature of the disclosure, in response to detection ofmultiple potential subjects within a received video image, the processoridentifies, via artificial intelligence (i.e., AI engine), a primarysubject from among the multiple potential subjects and aligns theprimary segment with an ROI that incorporates the primary subject.

VISCM 250 also includes action center virtual interface module 274,which supports generation and presentation of action centers within orproximate to the displayed location of one or more of the secondarysegments. In the illustrative example of FIG. 2 , both a left and aright virtual interface 276 a, 276 b are presented, each havingrespective virtual control functions (or selections or control features)278 a-278 c and 279 a-279 c. At least one of the left control functions,e.g., first control function 278 a is linked to and activates selectionof one or more of right control functions, e.g., Pt control function 279a. The interrelation of these control functions will become clearerduring the description of the later figures.

VCSA 252 includes graphical user interface 281 for the videocommunication application in which the video images are presented, videopreview control functions 283, video image settings 284, received videodata 285, remote second participant control sharing UI 286, and receivedsecond participant selections 287. The specific operations and/orfunctions associated with the remote second participant control sharingUI 286 and received second participant selections 287 will be explain ingreater details later. Graphical user interface 281 includes a videopresentation screen, a video preview screen, and a settings screen.

FIG. 3 depicts a settings user interface for a video communicationapplication operating on the communication device and presenting a userselectable option for activating local video segmenting features of anartificial intelligence (AI) engine, in accordance with one or moreembodiments. Specifically, FIG. 3 presents an example settings screen300 having a user interface 305 with a plurality of selectable optionsfor the user to set up the video communications application on his/herdevice. Settings screen 300 includes a video image preview 310 whichpresents local participant 105 in a local scene 315 captured by one ofthe device's ICDs. Included within the selectable options is video imagesegmenting feature 320, which enables a user to activate (i.e., set to“on”) the various functions provided by the different modules introducedabove and described herein. Settings screen 300 also presents integratedcommunication session control options 325 that are visible to the userand selectable during the video communication session.

As illustrated by FIG. 4A, activation of the functions described hereincan also be triggered within a preview user interface. FIG. 4A providesan example illustration of a presentation by electronic device 110 of acapture video image of a local scene within a preview screen (or previewuser interface) 400 of the example video communication application. Thepreview screen 400 is presented on display 120 when video communicationsapplication is activated or based on a user selection within a settingswindow of video communications application. Preview screen presentspreview video image 405, which includes a mirrored view of localparticipant 105 with upper torso and face shown along with right hand410, from which index finger is extended upwards and to the side.Preview screen 400 also presents two side panels 415, 416 withselectable settings components that can be physically selected by mouse,stylus, or keyboard selection. Within right side panel 416, a videoimage segmenting (VIS) function 420 is provided for user selection toactivate (i.e., turn on) the VISCM implemented functions. A bottomcontrol panel 425 is presented with different control options that canbe selected by mouse, stylus, or keyboard selection.

Returning to FIG. 2 , DOPM 254 includes AI engine 205′, which providesthe functions of demonstration object identification and reversemirroring 288 to identify and manipulate demonstration object 289.According to one or more embodiments, and as illustrated and describedherein, demonstration object 289 is a text-based demonstration object.However, it is appreciated that AI engine 205′ can be similarly used toidentify additional objects that may benefit from not being presented inits mirrored image. Additional details on the implementation of DOPM 254will be provided later in the disclosure.

Communication module 256 enables communication device 110 to communicatewith wireless communication network 132 and with other devices, such assecond communication device 140, via one or more of audio, text, andvideo communications. Communication module 256 can support variouscommunication sessions by communication device 110, such as audiocommunication sessions, video communication sessions, text communicationsessions, communication device application communication sessions, or adual/combined audio/text/video communication session.

CCM 296 includes camera parameters and settings 266, and AI engine 205,which further includes subject tracking 298 and camera selection modules299. Other data 200 can also be provided within system memory, and thepresented modules are for example only. Camera parameters and settings266 can include fixed values such as focal length, maximum frame rate,and resolution and variable values such as light level values,directional values, and distances to objects within a camera FOV. Cameraparameters and settings 266 are values and characteristics that canchange during the operation of ICDs 232 and 233 to capture images by thecameras. Camera parameters and settings 266 can include various settingssuch as aperture, shutter speed, iso level, white balance, zoom level,directional settings (i.e., region of interest (ROI)), distancesettings, focus and others. In one embodiment, camera parameters andsettings 266 can be determined by either processor 202 or by ICDcontroller 234 or AI engine 205. In one embodiment, camera parametersand settings 266 can be adjusted by the local participants use ofgestures that are captured by the ICD and associated with one or more ofthe virtual interface functions within the currently displayed actioncenter. Similarly, other characteristics of video images can bedetermined and/or modified, such as light levels within the ROI,contrast, blur, etc., can be adjusted by the local participants use ofgestures that are captured by the ICD and associated with one or more ofthe virtual interface functions within the currently displayed actioncenter.

With the above components integrated within communication device 110, afirst aspect of the disclosure presents a communication device 110having at least one image capturing device (ICD) 233 that captures video(e.g., video image stream 291) of a local scene, an interface 264/265 bywhich the electronic device connects to a local display device 120, anda system memory 220 having stored thereon a video image segmentation andcontrol module (VISCM) 250 comprising program instructions for spatiallydelineating video images into two or more segments that can beindividually presented within or excluded from a video feed. The atleast one processor 202 is communicatively coupled to the local displaydevice 120, to each of the at least one image capturing device 132/133,and to the system memory 220. The at least one processor 202 executes orprocessor the program instructions of the VISCM 250, which causes the atleast one processor 202 to perform a series of processes, which areillustrated by FIGS. 4A-4C and 5A-5B.

According to one aspect of the disclosure, the at least one processor202 divides the video image into different segments in response to userselection of a video image segmenting function 420 before or during thevideo communication session 136. The segmentation feature is triggeredis response to the activation occurring while the communication device110 is communicatively connected to the display device 120 and is beingutilized as a camera for capturing the video images to transmit to thevideo communication session 136. FIGS. 3 and 4B illustrate the VISfeature/function 320/420 set in an activated or on state. The at leastone processor identifies a primary region of interest (ROI) 430 within avideo image 405 captured by the at least one ICD 232/233, the primaryROI 430 being a smaller area than the video image 405. The primary ROI430 is selected via artificial intelligence (e.g., AI engine 205), whichincorporates within the primary ROI 430, a face and upper torso of aperson (105) within a field of view of the at least one ICD 232/233, andexcludes lower arms and hands (e.g., right hand 410) of the person,which are incorporated into one or more other segments. The processor202 delineates the video image 405 into two or more segments, includinga primary segment 440 that encompasses the primary ROI and at least asecondary segment 445A/445B. FIGS. 4B-4C illustrates the segmenting of apreview of captured video images 405 into a plurality of segments,including a primary segment (primary segment 440) encompassing a primaryregion of interest (ROI) 430 and individual, vertical right and leftsecondary segments 445A/445B (FIG. 4B) and horizontal top and bottomsecondary segments 446A/446B (FIG. 4C) of the video image 405. Aspresented in FIG. 4B, each of the left and right vertical secondarysegments 445A, 445B is proximate/adjacent to one of side panels 415/416with selectable settings components associated with one of the devicesettings and/or application settings, according to one or moreembodiments. As provided within FIGS. 4B and 4C and several otherimplementation figures presented herein, the processor is configured todelineate the captured video image into geometric shaped segmentscomprising rectangular segments. The segments comprise at least one of atop segment 446A, a bottom segment 446B, a left segment 445A, and aright segment 445B, arranged relative to the primary segment 440 asviewed in the figure. It is appreciated that the segments presented onthe left and right of the figure appear reversed to the localparticipant.

Referring now to FIGS. 5A-5B and 6A-6C. FIG. 5A illustrates individual,physically separated, vertical segments of the video image, surroundinga central, primary segment that includes the participant's upper torsoand face as the ROI, with at least one vertical segment presenting anaction center within selectable functions/features superimposed over theoriginal side segment to enable selection via air gesturing at a normalspatial position of the participant's hand, according to one or moreembodiments. FIG. 5B illustrates individual, physically separated,vertical segments of the video image, surrounding a central, primarysegment that includes the participant's user's upper torso and face,with at least one vertical segment presenting a virtual whiteboard thatcan be written on via air/finger gesturing, according to one or moreembodiments.

In accordance with a first aspect of the disclosure, the processor 202associates an action center 510A/510B having a virtual interface 515Awith a location of the secondary segment 445A/445B in the video image405. The virtual interface 515A includes at least one displayed featurethat can be selected via one of air gestures and screen touches duringpresentation of the primary segment 440 of the video image 405. As shownby each of the figures, the processor 202 presents a preview with thedelineated segments of the video image on the display device with atleast the primary segment 440 and the one or more other segments (e.g.,secondary segments 445A-445B) of the video image 405 and the virtualinterface of the action center 510A/510B being visible within the localdisplay device 120. In one or more embodiments, the processor 202presents the virtual interface on the display device 120, visiblyadjacent to or integrated as an overlay on top of (i.e., to at leastpartially obscure) the closest secondary segment. As an example, FIG. 5Aprovides a right action center 510B with participant listing settingsuser interface 515A that overlays the left vertical segment (445B) ofthe video image 405. FIG. 5B provides additional left and right actioncenters 510A, 510B respectively overlaying (i.e., replacing or makingobscure/non-visible) the left and right secondary segments (not shown).Left action center 510A incorporates a whiteboard 520 that can be sharedwith other participants to the video communication session. Right actioncenter 510B provides a UI 515B that provides other gesture-selectablefeatures and functions. With whiteboard 520 (FIG. 5B) presentedat/above/adjacent to the location of hand 410 and specifically theextended digit being used as a gesturing tool and writing tool, localparticipant 105 can create images (text, drawings, etc.) on whiteboard520 by gesturing the words or outlining images with the extended digit.AI engine 205 recognizes the gesturing with the right hand 410 andassociates the gesturing movement with writing on the whiteboard 520.The AI engine determines, from camera detected hand gestures/inputs295A, the particular written content being presented by the handmovements, perhaps with reference to the gestureidentifying/interpreting database 295B. The processor 202 then generatesthe corresponding content on the surface of the whiteboard 520. Theprocessor selectively transmits a corresponding action center overlayedwith the virtual whiteboard to enable cross sharing of content placed onthe virtual whiteboard with each second device that receives thecorresponding action center along with the primary segment.

To access the video communication session, the communication device 110includes a communication interface (e.g., WCS 242) that enables thecommunication device 110 to communicatively connect with at least thesecond device 140. The processor is communicatively coupled to thecommunication interface, and the processor enables the communicationdevice 110 to connect, via the communication interface, to the seconddevice 140 via a video communication session 136 enabling exchange ofvideo, audio, and other data with the second device 140. The processorselectively transmits, via the video communication session to the seconddevice, only the content within the primary segment 440 of the videoimage 405 in response to the video image being locally presented as asegmented video image preview. The secondary segment(s) 445 and theassociated virtual interface 515A (FIG. 5A) of the action center 510A(FIG. 5A) are only presented within the preview on the local displaydevice 120.

In one or more embodiments, and as presented by FIG. 4C, the processorprovides a visual indication of which segment is the primary segment 440that is being shared with the second device. In the figure the visualindication is presented by a bold outline of primary segment 440. Othertypes of visual indication, such as color coding or shading or flashingof the primary segment 440, or blurring of the secondary segments 445,etc., can be utilized to inform the local participant of which segmentof the video image 405 is the primary segment that is being transmittedto other participants on the video communication session.

FIGS. 6A-6C and 7A-7C presents pairing of the video image presented by alocal communication device display and on a remote participant's displayfollowing segmentation of the video image and local participantselection of background features from the action center, according toone or more embodiments. With specific reference to FIGS. 6A-6C, theprocessor is further configured to detect, within a captured videoimage, one or more air gestures spatially proximate in preview screen toa visual on-screen location of a specific feature among the at least onefeature within the virtual interface. The one or more air gesturesrepresent a local participant selection of the specific feature orfunction. The at least one feature includes at least one of imagecapturing device configuration and settings, video image presentationsettings, local device settings, and application settings for the videoconference application that enables the video communication session withat least one second device. In response to identifying the selection ofthe specific feature, the processor enables a corresponding functionassociated with the specific feature. The enabling of the function canoccur concurrently with a capturing and transmission of at least theprimary segment of the video image to the video communication session.

In one or more embodiments, the processor 202 is further configured toidentify selection of the specific feature, which activates a functionthat modifies one or more characteristics of at least the content withinthe primary segment of the video image. The processor applies thefunction to an original content within at least the primary segment 440to generate a resulting modified content, and the processor locallypresents and remotely transmits the resulting modified content withinthe primary segment 440 in place of original content.

With reference to the illustrative examples of FIGS. 6A-6C and referenceto paired illustrations of FIGS. 7A-7C, webcam local preview 600Aprovides user interface 515C of left action center 510A withgesture-selectable background image features, including backgroundeffect 605 and blur effect 610. In FIG. 6A, webcam local preview 600Apresents primary segment 440 before any background features are applied,and the primary segment 440 is shared with its captured backgroundwithin receiving device video feed 700A. Receiving device video feed700A is displayed on the remote display device 710A of a second device140 participating in the video communication session. Right userinterface does not currently have any selected options; However,background effect 605 has been selected, as indicated by the darkenedhue, by participant gesturing or hovering her right finger over thespatial area beneath which the virtual user interface is presented inthe video image.

According to one aspect, because the preview is displaying a mirrorimage of the local participant, the participant selects items on theleft action center with his right hand, and vice versa. The right handappears adjacent to and/or within the same space as the left actioncenter with preview image mirroring. In instances where no mirroringoccurs in the preview image, the correct right or left hand andcorresponding right or left action centers are aligned. In an alternateembodiment the selection can be is assumed to be based on the user'sview of the mirrored image, and the selection can be further enhanced byallowing an icon to track the user's gestures on the preview screen sothat the user can visibly see which action center the user isinterfacing with. In one or more embodiments, the selection of rightversus left placement of the action center can be a user preference,where the user can selectively assign the location of specific actioncenters within the settings menu of the video communication application.A left-handed user can thus reverse the relative placement of the actioncenters. Also, a user can move the action centers with the most commonlyutilized features to within the best virtual location for capturing theusers gesture inputs.

Referring back to the figures, FIG. 6B illustrates second view of webcamlocal preview 600B in which the left action center 510A (right sideaction center from the viewpoint of local participant 105) is nowpopulated with selectable features for background effect 605. Thesefeatures are made visible within left action center 510A in response toselection of background effect 605 within right action center 510B (leftside from the viewpoint of local participant 105). Followingparticipants selection (by gesturing) of a particular background 620A,processor 202 applies the selected background 625A to the video image.The primary segment of the video image is transmitted with the selectedbackground 625A via video communication session to second device anddisplayed as receiving device video feed 700B with selected background625A. Similarly, local participant selection of a next background 620Bby air gestural inputs causes processor 202 to change the background tonext selected background 625B presented in third view of webcam localpreview 600C. Primary segment 440 of video image 405 is then transmittedto second device with next selected background 625B applied to receivingdevice video feed 700C.

FIG. 8 illustrates preview video image with a different set of exampleimage control features within the virtual interface of the actioncenters of webcam local preview 800. Right action center 510B provides avirtual user interface that includes subject tracking 805. Left actioncenter 510A includes a sliding bar selection 810 for setting thesensitivity of subject tracking. Each action center 510A/510B andcorresponding selectable objects within respective virtual userinterface 815, 820 overlay the secondary segments (445A-445B) of thelocally captured video image 405.

FIG. 9 illustrates webcam local preview 900 providing examples ofseveral features and sub-features for applying changes in the videoimage background effect to only the primary segment that is beingtransmitted to the video communication session, according to one or moreembodiments. As illustrated, selection of the particular background 620from left action center 510A applies a corresponding new/differentbackground to the primary segment 440. Secondary segments 445 retaintheir normal background captured with the video image. As an extensionof this feature, the application of the background effect can begranularly applied to the preview of each individual segment that isuniquely selected to be transmitted to a second participant. Asillustrated within right action center 510B, virtual user interface mayalso provide a granular series of each delineated segment with an optionto select that segment independent of the other segments available. Asshown, in addition to selection of background effect feature 905,primary segment background effect option 910 is selected, as indicatedby the X inside the selection box. This limits the background effect tobeing applied to only the primary segment 440. Additional selectionboxes are provided for each of the other delineated secondary segments,none of which are shown selected. If local participant 105 checks theselection box associated with one or both of the two secondary segments445, then the particular background 620 is applied to the selectedsecondary segment(s) as well. This feature enables the background effectto be applied to the primary segment 440 and a selected one or moresecondary segment 445 that is also selected to be transmitted over thevideo communication session to the second device(s).

In one or more embodiments, selection of the secondary segments occursindependent of the application of a background effect. These embodimentsenable the local participant to granularly/selectively choose which ofthe segments to transmit over the video communication session. Rightaction center 510B is shown having additional selectable features,including ROI zoom 915 and ROI XY Positioning 920, which enables a localparticipant (or a remote authorized second participant) to adjust a sizeand location of the primary segment 440. Once delineated on the userdevice, primary segment can thus be expanded, made smaller, shifted leftor right, up or down, extended horizontally or vertically, etc.

FIGS. 10A-10B (collectively FIG. 10 ) depict a flowchart of a method bywhich an communication device performs segmenting of a locally capturedvideo image presented within a preview of the video image thatincorporates an action center with a virtual user interface for receiptof user selections via air gestures, according to one or moreembodiments. The descriptions of the various methods presented hereinare provided with general reference to the specific componentsillustrated within the preceding figures. Specific components referencedin method 1000 and other methods described herein may be identical orsimilar to components of the same name used in describing the precedingfigures. In one or more embodiments, the features of method 1000 and theother methods described herein can be provided by processor 202 ofcommunication device 110 operating on program instructions from thevarious modules and applications presented in FIGS. 2A-2B. It isappreciated that certain of the described features of method 1000 andthe other presented methods may be completed by AI engine 205, ICDcontroller 234, GPU 203 c, etc., and these features are all generallydescribed as being performed by processor 202 for simplicity.

With reference to FIG. 10A, method 1000 includes initiating a videocommunication session application and establishing a video communicationsession with at least one second device, while the communication deviceis configured to be used as a webcam and is presenting video imagepreview within the applications user interface via a connected displaydevice (block 1002). Method 1000 includes receiving, by a processor ofthe communication device from an active/connected image capturingdevice, a video stream comprising video images within sequential videoframes (block 1004). Method 1000 includes identifying, by artificialintelligence (AI) processing, a primary region of interest (ROI) withina video image, the primary region of interest being a smaller area thanthe video image (block 1006). Method 1000 includes delineating the videoimage into two or more segments, including a primary segment thatencompasses the primary ROI and at least a secondary segment (block1008). According to one or more embodiments, dividing the video imageinto different segments is performed in response to user selection of avideo segmentation function at least before or during the videocommunication session, while the electronic device is communicativelyconnected to the display device and is being utilized as a camera forcapturing the video images to transmit to the video communicationsession. Additionally, delineating the video image with a primarysegment that encompasses the primary ROI includes incorporating withinthe primary segment, a face and upper torso of a person within a fieldof view of the at least one image capturing device, and incorporatinglower arms and hands of the person into one or more other segments.

Method 1000 includes associating a virtual interface with a location ofthe secondary segment in the video image, the virtual interfaceincluding at least one feature that can be selected via one of airgestures and screen touches during presentation of the primary segmentof the video image (block 1010). Method 1000 includes presenting apreview of the delineated segments of the video image on the displaydevice with at least the primary segment and the secondary segment ofthe video image and the virtual interface being visible within thedisplay device (block 1012).

In presenting the preview, method 1000 includes presenting, on thedisplay device, a preview comprising the primary segment and the one ormore other segments of the video image as delineated segments from eachother and presenting the virtual interface on the display device,visibly adjacent to or integrated as an overlay on top of the secondarysegment. In one or more embodiments, method 1000 can also includeproviding a visual indication of which segment is the primary segmentthat is being shared with the second device.

From block 1012, method 1000 transitions to decision block 1014 in FIG.10B. At decision block 1014, method includes detecting, within acaptured video image, one or more air gestures proximate to a locationof a specific feature among the at least one feature within the virtualinterface, the one or more air gestures representing a selection of thespecific feature. The at least one feature includes at least one ofimage capturing device configuration and settings, video imagepresentation settings, local device settings, and application settingsfor a video conference application enabling a video communicationsession with at least the second device. Method 1000 includes inresponse to identifying the selection of the specific feature, enablinga corresponding function associated with the specific featureconcurrently with a capturing and transmission of at least the primarysegment of the video image (block 1016). Method 1000 includesidentifying selection of the specific feature, which activates afunction that modifies one or more characteristics of at least thecontent within the primary segment of the video image (block 1018).Method 1000 includes applying the function to the content within atleast the primary segment to generate a resulting modified content(block 1020), and locally presenting and remotely transmitting theresulting modified content within the primary segment in place oforiginal content (block 1022). Method then ends.

With ongoing reference to FIGS. 1-9 , according to one aspect of thedisclosure, the program instructions of the VISCM cause the at least oneprocessor to associate control features of the virtual interface with atleast one selected second device from among the plurality of seconddevices connected to the video communication session. The processortransmits, to each of the plurality of second devices, at least theprimary segment for displaying at a corresponding second display of asecond device during the video communication session, and the processorconcurrently transmit, to each of the one or more selected seconddevice, the control features of the virtual interface associated withthe at least one selected second device to trigger the selected seconddevice to present the virtual interface on the corresponding seconddisplay, concurrently with presenting the primary segment as parts of acombined video stream from the electronic device. Each of the pluralityof second devices can be transmitted a different combination ofsegmented video images comprising at least the primary segment with zeroor more virtual interfaces remotely presented on corresponding seconddisplays for selection at a corresponding selected second device. As theoriginator of the video feed, the communication device 110 is referredto as an image stream transmitting (IST) device.

With reference now to FIG. 11 , there is illustrated an exampleselection table that is generated and presented on a display device 120of a local participant 105 to enable selection of which remote, secondparticipant (140 a-140 n) on the video communication session 136 isprovided access to the action center features that control/manipulatecharacteristics of the primary segment on the local display device,which characteristics are locally presented and/or presented within thevideo feed to other remote, second participants, according to one ormore embodiments. In one or more embodiments, to associate the virtualinterface with the at least one selected second device from among theplurality of second devices, the at least one processor 202 presents aconference participant permission settings window (PPSW) 1100 (see alsoUI 515A, FIG. 5A) that receives a selection ranging from between none toone or more virtual interfaces to provide to a particular second device140 identified via an associated participant name (e.g., first column oftable). In response to receiving the selections by local participant,processor 202 associates a selected one or more virtual interfaces withthe particular second device. As shown within PPSW 1100, localparticipant is set to receive by default a preview of all of thesegments and action centers with virtual UIs. Participants 2 and 6 areassigned only the primary segment. Participant 3 is identified as a hostor cohost and provide action center control features. Participant 3 thusreceives both primary segment and the action center virtual UI.Participant 4 is not provided host control but is provided with accessto action center and thus also receives both the primary segment and theaction center virtual UI. Participant 5 receives the full, unedited orunsegmented video feed.

From the above table example, each of the plurality of second devicesthat have not been assigned to receive an action center virtualinterface, e.g., Participant 2 and 6, only receives the primary segmentin a received video feed from the IST device. PPSW 1100 also includeshost control inset 1110 and video control inset 1115, which respectivelypresent a listing/indication of which participants have been given hostcontrol (with capability to modify the local video image for allparticipants), and a listing/indication of which participants have beengiven local video control (with capability to modify the local videoimage for just that participant and perhaps on the IST device(communication device 110).

FIG. 12 illustrates components of a video communication environment inwhich a receiving second device receives a video feed from and imagestream transmitting (IST) device along with virtual action centers thatenable remote modification of the image feed, according to one or moreembodiments. Referring now to FIG. 12 , there is illustrated a videocommunication environment 1200 in which a video communicationapplication server 135 supports/enables a video communication session136 that involves communicating a video feed 1205 from image streamtransmitting (IST) device 110 to a receiving second device 140. The termIST device 110 is utilized to refer to the specific device that istransmitting an image stream with video image s that are being viewed onthe receiving second device 140. In the illustrative embodiments, ISTdevice 110 is communication device 110 presented within the precedingfigures. It is appreciated, however, that the term IST device can beapplied to any one of the devices that are communicatively coupled tovideo communication session 136 and transmits a video image, which isshared with other devices connected to the video communication session.Importantly, both IST device 110 and second device 140 are configuredwith program instructions to enable the unique functions describedherein.

According to the illustrative embodiment, IST device 110 presentspreview image 1240 that is shared with second device 140 and ispresented within second device display as received video images 1240′.IST device 110 also provides to at least two other second devices,identified as participant 3 1220A (provided with host-level video imagemodification privileges) and participant 4 1220B (provided with limited(or individual) video image modification privileges) the featuresprovided by one or more action centers 510A-510B. Received virtualaction centers 510A′, 510B′ are presented on the corresponding displaydevice of the second device along with the received video image 1240′.By making selections and interfacing with the provided features of theprovided action centers, the receiving second device 140 a is able toremotely modify the video image 1240 at IST device 110 and on theircorresponding local displays. Participant 3 with host privilege canfurther modify video image 1240 received by other second devices.

As shown by the participants permission settings window (PPSW) 1100 andalso within video preview screen of IST device 110 each second deviceprovided with action center functions are indicated via associatedidentifiers (1220 a-1220 b) on the local display. These identifiers canbe provided within display participant listing feature (UI 515A, FIG.5A), in one embodiment. The at least one processor presents, on thedisplay device, a local preview window 1210 comprising a visualindication (e.g., 1100, 1220 a-1220 b, or 515A) of which of the one ormore virtual interfaces are associated for transmission to each of theplurality of second devices designated to receive at least one virtualinterface within a transmitted video stream. In one or more embodiments,as indicated by FIG. 12 , the visual indication may persist on screen,highlighting the video image or name of the selected participants duringthe video communication session.

In one or more embodiments, the at least one processor detects selectionof the particular second device 140 as an administrative device (e.g., adevice having host-level privileges), auto-selects specific virtualinterfaces having functions controllable by the administrative device,and transmits the auto-selected virtual interfaces concurrently with theprimary segment to enable the virtual interfaces to be visuallypresented, concurrently with the primary segment, within the seconddisplay of the particular second device.

In one or more embodiments, the processor further enables the device tomonitor for receipt of an indication of a remote selection or activationof at least one control function from within the virtual interfaces atthe particular second device. In response to receiving an indication ofa remote selection that is associated with a modification of acharacteristic of the video images within the primary segment, theprocessor applies the modification to video image content within theprimary segment at the electronic device and transmits the modificationof the video image content to each of the plurality of second devicesparticipating in the video communication session. In one embodiment, thebroadcasting of the modified video image content is triggered only whenthe participant has host control, such as Participant 3 in PPSW 1100.Participant 4 is capable of also modifying the video image, but themodification only affects the primary segment content presented withinthe IST device display and/or the remote display of Participant 4.

FIG. 13 provides another example video communication environment 1300 inwhich a receiving second device remotely modifies the content and/orcharacteristics of at least the primary segment of a video image that isreceived from the IST device 110. The components of FIG. 13 are similarto those of FIG. 12 . However, in the present figure, participant/userof receiving second device 140 a selects by gesturing or other inputmethod a particular background effect 1320 from the received virtualaction centers 510A′ to apply to the received video image 1240′ (FIG. 12). The selection at receiving second device 140 a of the particularbackground effect 1320 is transmitted via the video communicationsession 136 as a background modification request 1330 to the IST device110. The IST device 110 responds to receipt of the backgroundmodification request 1330 by applying the modification to the localvideo image 1240, generating modified video image 1340. Modified videoimage 1340 is then transmitted via video feed 1345 to each second device140 a-140 n, which displays local versions of modified video image1340′. According to the illustrative embodiment, Participant 3 1220A′ isvisually identified within IST device display as the source of themodification to video image 1340.

FIG. 14 (14A-14B) depicts a flowchart of a method by which acommunication device selectively transmits different combinations ofprimary segments of a segmented local video image and one or morevirtual interfaces of action centers associated with modifying one ormore characteristics of the primary segment and generating visiblefeedback of which second devices received the enhanced action centerfunctions, according to one or more embodiments. Method 1400 isdescribed with reference to the preceding figures. Beginning at FIG.14A, method 1400 includes receiving, by processor 202 from anactive/connected image capturing device, a video stream comprising videoimages within sequential video frames (block 1402). Method 1400 includesidentifying, by artificial intelligence (AI) processing, a primaryregion of interest (ROI) within a video image, the primary region ofinterest being a smaller area than the video image (block 1404). Method1400 includes delineating the video image into two or more segments,including a primary segment that encompasses the primary ROI and atleast a secondary segment (block 1406). Method 1400 includes associatinga virtual interface with a location of the secondary segment in thevideo image, the virtual interface presenting at least one feature thatcan be selected via one of air gestures and screen touches duringpresentation of the primary segment of the video image (block 1408).Method 1400 includes associating the virtual interface with one or moreselected second device from among the plurality of second devices (block1410). According to one or more embodiments and as illustrated withinmethod 1400, associating the virtual interface with the one or moreselected second device from among the plurality of second devicesincludes presenting a participant settings interface (e.g., PPSW 1100)that receives a selection ranging from between none to one or morevirtual interfaces to provide to a particular second device identifiedvia an associated participant name (block 1412) and in response toreceiving the selection, associating a selected one or more virtualinterfaces with the particular second device (block 1414).

Method 1400 includes transmitting, to each of the plurality of seconddevices, at least the primary segment 440 for displaying at acorresponding second display of a second device during the videocommunication session (block 1416). Method 1400 then transitions toblock 1418 in FIG. 14B. At block 1418, method 1400 includes concurrentlytransmitting, to each of the one or more selected second device, thevirtual interface associated with the selected second device to triggerthe selected second device to present the virtual interface on thecorresponding second display, concurrently with presenting the primarysegment as parts of a combined video stream from the electronic device.Accordingly, each of the plurality of second devices can be transmitteda different combination of segmented video images including at least theprimary segment with zero or more virtual interfaces remotely presentedon corresponding second displays for selection at a correspondingselected second device. Method then includes presenting, on the displaydevice, a local preview window comprising a visual indication of whichof the one or more virtual interfaces are associated for transmission toeach of the plurality of second devices designated to receive at leastvirtual interface within a transmitted video stream (block 1420). In oneor more implementations of the features of blocks 1410-1418, method 1400can include detecting selection of the particular second device as anadministrative device, auto-selecting specific virtual interfaces havingfunctions controllable by the selected administrative device, andtransmitting the auto-selected virtual interfaces concurrently with theprimary segment to enable virtual interfaces to be visually presentedwithin the second display of the particular second device concurrentlywith the primary segment.

Returning to FIG. 14B, method 1400 includes monitoring for receipt of anindication of a remote selection or activation of at least one controlfunction from within the virtual interfaces at the particular seconddevice (block 1422). Method 1400 further includes, in response toreceiving an indication of a remote selection that is associated with amodification of a characteristic of the video images within the primarysegment, applying the modification to video image content within theprimary segment at the electronic device (block 1424) and transmittingthe modification of the video image content to each of the plurality ofsecond devices participating in the video communication session (block1426). Method 1400 then ends.

According to another aspect of the disclosure, and is introduced withinthe description of FIG. 13 , a second (communication) device 140 thatreceives a video stream from an image stream transmitting (IST) device110 is provided with functionality that enables the receiving secondcommunication device 140 to modify a received image stream based onpermissions assigned to the second communication device 140) by the ISTdevice 110. According to one or more embodiments, second devices 140a-140 n can be similarly configured to perform the aforementionedfeatures of video image segmenting, action center generation andpresentation, transmission of primary segments along withparticipant-assigned action center features, which were described asbeing performed by communication device 110. Communication device 110can thus serve as the receiving second device that receives a video feedfrom another one of the second devices that serves as the IST device.The device receiving the video feed also receives the one or more actioncenters with virtual user interface objects that can be selected tomanipulate the video feed. Second communication device 140 includes atleast one image capturing device that captures video of a local scene, adisplay device, and a communication interface that enables thecommunication device 110 to communicatively connect with and receivevideo data from an IST device 110 during a video communication session.Second communication device 140 also includes a memory having storedthereon a segmented video image control module (SVICM) enablingpresentation and control of a received segmented video feed including aprimary segment and at least one secondary segment presenting a virtualcontrol interface (e.g., an action center 510). Second communicationdevice 140 includes at least one processor communicatively coupled tothe display device, the communication interface, each of the at leastone image capturing device, and to the memory. The at least oneprocessor processes program code of the SVICM, which enables theprocessor to receive the segmented video stream with the primary segmentpresenting a region of interest captured at the IST device and the atleast one secondary segment, each presenting a respective virtualcontrol interface. The at least one processor presents, on the displaydevice, a control enabling view of the segmented video stream comprisingboth the primary segment and the at least one secondary segment with therespective virtual control interface, each virtual control interfacefurther presenting at least one selectable feature that can be selectedby an input from among a screen touch and an air gesture during localpresentation of the received segmented video stream. The processormonitors for movement captured by the at least one image capturingdevice that is indicative of the input occurring within a spatiallocation proximate to the selectable feature within a correspondingsecondary segment. In response to detecting the input, the processoridentifies a function associated with the selectable feature andactivates the function to affect one or more characteristics of a videoimage presented within the primary segment. The selectable featuremodifies at least one characteristic of content visibly presented withinthe primary segment, and the processor modifies the characteristic ofthe content in response to the input.

In one embodiment, the processor modifies the characteristic of thecontent at the IST device by transmitting a corresponding contentsettings update indication to the IST device via the video communicationsession. Also, the processor modifies the characteristic of the contentfor at least one of: the electronic device only; and all connecteddevices participating in the video communication session that receivethe primary segment from the IST device. The function modifies one ormore of: (i) characteristics of the content within the primary segment,(ii) characteristics of the primary segment, (iii) characteristics ofthe video communication session, (iv) a local device setting, and (v) asetting of the IST device.

In one embodiment, one of the secondary segments includes a virtualwhite board and the input activates the white board for content sharingbetween the electronic device and at least the IST device. In one ormore embodiments, the received segmented video stream comprises apreview window identifying which segments from among the primary segmentand the at least one secondary segment are received by each seconddevice connected to the video communication session that receives videocontent from the IST device.

FIGS. 15 depicts a flowchart of a method 1500 by which an electronicdevice receives, via a video communication session, a video feed with asegmented video image from an IST device and performs control operationson one or more characteristics of the video image at the IST deviceusing a received and displayed virtual interface of an action centerassociated with the received video image, according to one or moreembodiments. Method 1500 includes receiving, by a processor via acommunication interface of an electronic device, a segmented videostream from an image stream transmitting (IST) device during a videocommunication session, the segmented video stream comprising a primarysegment presenting a region of interest captured at the IST device andat least one secondary segment, each secondary segment presenting arespective virtual control interface (block 1502). Method 1500 includespresenting, on a display device, a control view of the segmented videostream comprising both the primary segment and the at least onesecondary segment with the respective virtual control interface, eachvirtual control interface further presenting at least one selectablefeature that can be selected via an input from among a screen touch andan air gesture during local presentation of the received segmented videostream (block 1504). Method 1500 includes monitoring for movement,captured by at least one image capturing device, that is indicative ofthe input occurring within a spatial location proximate to theselectable feature within the respective secondary segment (block 1506).Method 1500 includes in response to detecting the input, identifying afunction associated with the selectable feature and activating thefunction to affect one or more characteristics of a video imagepresented within the primary segment (block 1508). The selectablefeature modifies at least one characteristic of content visiblypresented within the primary segment. Method 1500 includes modifying thecharacteristic of the content at the IST device by transmitting acorresponding content settings update indication to the IST device viathe video communication session (block 1510).

In the above-described methods, one or more of the method processes maybe embodied in a computer readable device containing computer readablecode such that operations are performed when the computer readable codeis executed on a computing device. In some implementations, certainoperations of the methods may be combined, performed simultaneously, ina different order, or omitted, without deviating from the scope of thedisclosure. Further, additional operations may be performed, includingoperations described in other methods. Thus, while the method operationsare described and illustrated in a particular sequence, use of aspecific sequence or operations is not meant to imply any limitations onthe disclosure. Changes may be made with regards to the sequence ofoperations without departing from the spirit or scope of the presentdisclosure. Use of a particular sequence is therefore, not to be takenin a limiting sense, and the scope of the present disclosure is definedonly by the appended claims.

Aspects of the present disclosure are described above with reference toflowchart illustrations and/or block diagrams of methods, apparatus(systems) and computer program products according to embodiments of thedisclosure. It will be understood that each block of the flowchartillustrations and/or block diagrams, and combinations of blocks in theflowchart illustrations and/or block diagrams, can be implemented bycomputer program instructions. Computer program code for carrying outoperations for aspects of the present disclosure may be written in anycombination of one or more programming languages, including anobject-oriented programming language, without limitation. These computerprogram instructions may be provided to a processor of a general-purposecomputer, special-purpose computer, or other programmable dataprocessing apparatus to produce a machine that performs the method forimplementing the functions/acts specified in the flowchart and/or blockdiagram block or blocks. The methods are implemented when theinstructions are executed via the processor of the computer or otherprogrammable data processing apparatus.

As will be further appreciated, the processes in embodiments of thepresent disclosure may be implemented using any combination of software,firmware, or hardware. Accordingly, aspects of the present disclosuremay take the form of an entirely hardware embodiment or an embodimentcombining software (including firmware, resident software, micro-code,etc.) and hardware aspects that may all generally be referred to hereinas a “circuit,” “module,” or “system.”Furthermore, aspects of thepresent disclosure may take the form of a computer program productembodied in one or more computer readable storage device(s) havingcomputer readable program code embodied thereon. Any combination of oneor more computer readable storage device(s) may be utilized. Thecomputer readable storage device may be, for example, but not limitedto, an electronic, magnetic, optical, electromagnetic, infrared, orsemiconductor system, apparatus, or device, or any suitable combinationof the foregoing. More specific examples (a non-exhaustive list) of thecomputer readable storage device can include the following: a portablecomputer diskette, a hard disk, a random access memory (RAM), aread-only memory (ROM), an erasable programmable read-only memory (EPROMor Flash memory), a portable compact disc read-only memory (CD-ROM), anoptical storage device, a magnetic storage device, or any suitablecombination of the foregoing. In the context of this document, acomputer readable storage device may be any tangible medium that cancontain or store a program for use by or in connection with aninstruction execution system, apparatus, or device.

Where utilized herein, the terms “tangible” and “non-transitory” areintended to describe a computer-readable storage medium (or “memory”)excluding propagating electromagnetic signals, but are not intended tootherwise limit the type of physical computer-readable storage devicethat is encompassed by the phrase “computer-readable medium” or memory.For instance, the terms “non-transitory computer readable medium” or“tangible memory” are intended to encompass types of storage devicesthat do not necessarily store information permanently, including, forexample, RAM. Program instructions and data stored on a tangiblecomputer- accessible storage medium in non-transitory form mayafterwards be transmitted by transmission media or signals such aselectrical, electromagnetic, or digital signals, which may be conveyedvia a communication medium such as a network and/or a wireless link.

The description of the present disclosure has been presented forpurposes of illustration and description, but is not intended to beexhaustive or limited to the disclosure in the form disclosed. Manymodifications and variations will be apparent to those of ordinary skillin the art without departing from the scope of the disclosure. Thedescribed embodiments were chosen and described in order to best explainthe principles of the disclosure and the practical application, and toenable others of ordinary skill in the art to understand the disclosurefor various embodiments with various modifications as are suited to theparticular use contemplated.

As used herein, the term “or” is inclusive unless otherwise explicitlynoted. Thus, the phrase “at least one of A, B, or C” is satisfied by anyelement from the set {A, B, C} or any combination thereof, includingmultiples of any element.

While the disclosure has been described with reference to exampleembodiments, it will be understood by those skilled in the art thatvarious changes may be made and equivalents may be substituted forelements thereof without departing from the scope of the disclosure. Inaddition, many modifications may be made to adapt a particular system,device, or component thereof to the teachings of the disclosure withoutdeparting from the scope thereof. Therefore, it is intended that thedisclosure not be limited to the particular embodiments disclosed forcarrying out this disclosure, but that the disclosure will include allembodiments falling within the scope of the appended claims.

What is claimed is:
 1. An electronic device comprising: at least oneimage capturing device that captures video of a local scene; a displaydevice; a communication interface that enables the electronic device tocommunicatively connect with and receive video data from an image streamtransmitting (IST) device during a video communication session; a memoryhaving stored thereon a segmented video image control module (SVICM)enabling presentation and control of a received segmented video streamcomprising a primary segment and at least one secondary segmentpresenting a virtual control interface; and at least one processorcommunicatively coupled to the display device, the communicationinterface, each of the at least one image capturing device, and to thememory, the at least one processor executing program code of the SVICM,which enables the processor to: receive the segmented video stream withthe primary segment presenting a region of interest captured at the ISTdevice and the at least one secondary segment, each presenting arespective virtual control interface; present, on the display device, acontrol enabling view of the segmented video stream comprising both theprimary segment and the at least one secondary segment with therespective virtual control interface, each virtual control interfacefurther presenting at least one selectable feature that can be selectedby an input from among a screen touch and an air gesture during localpresentation of the received segmented video stream; monitor formovement captured by the at least one image capturing device that isindicative of the input occurring within a spatial location proximate tothe selectable feature within a corresponding secondary segment; and inresponse to detecting the input, identify a function associated with theselectable feature and activate the function to affect one or morecharacteristics of a video image presented within the primary segment.2. The electronic device of claim 1, wherein the selectable featuremodifies at least one characteristic of content visibly presented withinthe primary segment, and the processor modifies the characteristic ofthe content in response to the input.
 3. The electronic device of claim2, wherein the processor modifies the characteristic of the content atthe IST device by transmitting a corresponding content settings updateindication to the IST device via the video communication session.
 4. Theelectronic device of claim 3, wherein the processor modifies thecharacteristic of the content for at least one of: the electronic deviceonly; and all connected devices participating in the video communicationsession that receive the primary segment from the IST device.
 5. Theelectronic device of claim 1, wherein the function modifies one or moreof: (i) characteristics of content within the primary segment, (ii)characteristics of the primary segment, (iii) characteristics of thevideo communication session, (iv) a local device setting, and (v) asetting of the IST device.
 6. The electronic device of claim 1, whereinone of the secondary segments comprises a virtual white board and theinput activates the white board for content sharing between theelectronic device and at least the IST device.
 7. The electronic deviceof claim 1, wherein the received segmented video stream comprises apreview window identifying which segments from among the primary segmentand the at least one secondary segment are received by each seconddevice connected to the video communication session that receives videocontent from the IST device.
 8. A method comprising: receiving, by aprocessor via a communication interface of an electronic device, asegmented video stream from an image stream transmitting (IST) deviceduring a video communication session, the segmented video streamcomprising a primary segment presenting a region of interest captured atthe IST device and at least one secondary segment, each secondarysegment presenting a respective virtual control interface; presenting,on a display device, a control enabling view of the segmented videostream comprising both the primary segment and the at least onesecondary segment with the respective virtual control interface, eachvirtual control interface further presenting at least one selectablefeature that can be selected via an input from among a screen touch andan air gesture during local presentation of the received segmented videostream; monitoring for movement, captured by at least one imagecapturing device, that is indicative of the input occurring within aspatial location proximate to the selectable feature within therespective secondary segment; and in response to detecting the input,identifying a function associated with the selectable feature andactivating the function to affect one or more characteristics of a videoimage presented within the primary segment.
 9. The method of claim 8,where the selectable feature modifies at least one characteristic ofcontent visibly presented within the primary segment, and the methodcomprises modifying the characteristic of the content in response to theinput.
 10. The method of claim 9, wherein the processor modifies thecharacteristic of the content at the IST device by transmitting acorresponding content settings update indication to the IST device viathe video communication session.
 11. The method of claim 10, wherein theprocessor modifies the characteristic of the content for at least oneof: the electronic device only; and all connected devices participatingin the video communication session that receive the primary segment fromthe IST device.
 12. The method of claim 8, wherein the function modifiesone or more of: (i) characteristics of content within the primarysegment, (ii) characteristics of the primary segment, (iii)characteristics of the video communication session, (iv) a local devicesetting, and (v) a setting of the IST device.
 13. The method of claim 8,wherein one of the secondary segments comprises a virtual white boardand the input activates the white board for content sharing between theelectronic device and at least the IST device.
 14. The method of claim8, wherein the received segmented video stream comprises a previewwindow identifying which segments from among the primary segment and theat least one secondary segment are received by each second deviceconnected to the video communication session that receives video contentfrom the IST device.
 15. A computer program product comprising anon-transitory computer readable medium having program instructions thatwhen executed by a processor of an electronic device connected to adisplay device enables the electronic device to perform the functionsof: receiving, by a processor via a communication interface of anelectronic device, a segmented video stream from an image streamtransmitting (IST) device during a video communication session, thesegmented video stream comprising a primary segment presenting a regionof interest captured at the IST device and at least one secondarysegment, each secondary segment presenting a respective virtual controlinterface; presenting , on a display device, a control enabling view ofthe segmented video stream comprising both the primary segment and theat least one secondary segment with the respective virtual controlinterface, each virtual control interface further presenting at leastone selectable feature that can be selected via an input from among ascreen touch and an air gesture during local presentation of thereceived segmented video stream; monitoring for movement, captured by atleast one image capturing device, that is indicative of the inputoccurring within a spatial location proximate to the selectable featurewithin the respective secondary segment; and in response to detectingthe input, identifying a function associated with the selectable featureand activating the function to affect one or more characteristics of avideo image presented within the primary segment.
 16. The computerprogram product of claim 15, wherein the selectable feature modifies atleast one characteristic of content visibly presented within the primarysegment, and the program instructions further enable the processor toperform the function of modifying the characteristic of the content atthe IST device in response to the input by transmitting a correspondingcontent settings update indication to the IST device via the videocommunication session.
 17. The computer program product of claim 15,wherein the processor modifies the characteristic of content for atleast one of: the electronic device only; and all connected devicesparticipating in the video communication session that receive theprimary segment from the IST device.
 18. The computer program product ofclaim 15, wherein the function modifies one or more of: (i)characteristics of content within the primary segment, (ii)characteristics of the primary segment, (iii) characteristics of thevideo communication session, (iv) a local device setting, and (v) asetting of the IST device.
 19. The computer program product of claim 15,wherein one of the secondary segments comprises a virtual white boardand the input activates the white board for content sharing between theelectronic device and at least the IST device.
 20. The computer programproduct of claim 16, wherein the received segmented video streamcomprises a preview window identifying which segments from among theprimary segment and the at least one secondary segment are received byeach second device connected to the video communication session thatreceives video content from the IST device.